A Punjabi to Hindi Machine Transliteration System

نویسنده

  • Gurpreet Singh Josan
چکیده

Transliteration is the general choice for handling named entities and out of vocabulary words in any MT application, particularly in machine translation. Transliteration (or forward transliteration) is the process of mapping source language phonemes or graphemes into target language approximations; the reverse process is called back transliteration. This paper presents a novel approach to improve Punjabi to Hindi transliteration by combining a basic character to character mapping approach with rule based and Soundex based enhancements. Experimental results show that our approach effectively improves the word accuracy rate and average Levenshtein distance of the various categories by a large margin.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Web Based Hindi to Punjabi Machine Translation System

Hindi and Punjabi are closely related languages with lots of similarities in syntax and vocabulary Both Punjabi and Hindi languages have originated from Sanskrit which is one of the oldest language. In terms of speakers, Hindi is third most widely spoken language and Punjabi is twelfth most widely spoken language. Punjabi language is mostly used in the Northern India and in some areas of Pakist...

متن کامل

Hindi to Punjabi Transliteration using Phonetic and Orthographic Rules

One of the important applications of Natural Language Processing is machine translation. Machine transliteration is an emerging and a very important research area in the field of machine translation. Translation systems translate message from source language to target language, keeping the exact meaning. While the transliteration system finds the same meaning word/sentence in another language, ...

متن کامل

Punjabi Machine Transliteration

Machine Transliteration is to transcribe a word written in a script with approximate phonetic equivalence in another language. It is useful for machine translation, cross-lingual information retrieval, multilingual text and speech processing. Punjabi Machine Transliteration (PMT) is a special case of machine transliteration and is a process of converting a word from Shahmukhi (based on Arabic s...

متن کامل

Statistical Approach to Transliteration from English to Punjabi

-Machine transliteration plays an important role in natural language applications such as information retrieval and machine translation, especially for handling proper nouns and technical terms. Transliteration is a crucial factor in CLIR and MT. It is important for Machine Translation, especially when the languages do not use the same scripts. This paper addresses the issue of statistical mach...

متن کامل

Development of a Punjabi to English Transliteration System

DEVELOPMENT OF A PUNJABI TO ENGLISH TRANSLITERATION SYSTEM Kamal Deep1 and Vishal Goyal2 1Department of Computer Science, Punjabi University, Patiala, India E-mail: [email protected] 2Assistant Professor, Department of Computer Science, Punjabi University, Patiala, India E-mail: [email protected] Machine transliteration has gained prime importance as a supporting tool for Machine translat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJCLCLP

دوره 15  شماره 

صفحات  -

تاریخ انتشار 2010